European Journal of Epidemiology — Latest Matching Preprints

1

Direct and mediated effects (DME) SLCMA: a novel method for life course modelling with time-varying covariates

Beer, S.; Simpkin, A. J.; Eldeeb, S. Y.; Zar, H. J.; Stein, D. J.; Dunn, E. C.; Smith, A. D. A. C.

2026-06-06 epidemiology 10.64898/2026.05.29.26354427 medRxiv

Top 0.1%

3.6%

Show abstract

Background: In prospective cohort studies, where an exposure is collected repeatedly, interest often lies in determining whether the timing of that exposure has a differential effect on a later outcome. The Structured Life Course Modeling Approach (SLCMA), where users select between temporal hypotheses of exposure specified a priori, provides one way to analyse such longitudinal data. However, few studies using SLCMA consider the effect of time-varying covariates (TVC) which may impact associations. Methods: We present a modified version of the SLCMA - called direct and mediated effects (DME)-SLCMA - which corrects for TVC. We first develop the DME-SLCMA method, test it through simulation, and apply it to psychosocial data from the Drakenstein Child Health Study (DCHS, n=336) to investigate relationships between maternal psychopathology, TVC of socioeconomic status, and offspring depressive symptoms. Results: We found that, on average, offspring depressive symptoms score increased by 3.9% (95% CI: 1.0%-6.9%, p = 0.039) for each unit of maternal psychopathology (SRQ) at 48 months whilst adjusting for time-varying socioeconomic status (at 18, 30, 42 and 54 months). Our simulations identified several realistic scenarios where selections ignoring TVC - with TVC mediated exposure effects present - were prone to be incorrect, including our DCHS example. Conclusion: DME-SLCMA is a robust new approach for life course modelling in the presence of time-varying covariates. We recommend adjusting for TVC whenever possible, and, when not possible, our simulation study identified that scenarios where mediated effects are comparable, or greater, in magnitude to direct effects are most prone to confounding.

2

Disentangling infectiousness and susceptibility by age group using transmission pair data: a study of SARS-CoV-2 household transmission

Leung, K. Y.; Miura, F.; Backer, J. A.

2026-06-05 epidemiology 10.64898/2026.06.04.26354892 medRxiv

Top 0.2%

3.1%

Show abstract

Background Differential contributions to transmission across age groups have been reported for many respiratory infections, including SARS-CoV-2. They are crucial for estimating the impact of age-specific interventions. Disentangling these age-dependent contributions remains challenging, as they may reflect differences in contact rates, biological susceptibility, or infectiousness. Aim We aim to jointly estimate age-specific per-contact infectiousness and susceptibility and their effect on the impact of age-specific interventions. Methods The age-specific infectiousness and susceptibility were jointly estimated in a Bayesian framework by combining contact data with transmission pair data (who-infected-whom). We applied this approach to 197,840 self-reported household transmission pairs collected in the Netherlands during the COVID-19 pandemic. Using these estimates, we projected the expected impact of school closure and work-from-home measures during the early stages of an epidemic in the absence of other interventions. Results Both infectiousness and susceptibility to SARS-CoV-2 infection were lowest in children aged 0-9 years and highest in adults over 30 years old, with 2- to 4.5-fold differences between these groups. Projected impacts of age-specific interventions indicated that school closures would reduce the reproduction number by 8% or 29% when age-specific susceptibility and infectiousness were or were not considered, respectively. Conversely, working-from-home policies would lead to reductions of 41% with and 20% without age-specific infectiousness and susceptibility. Conclusion Our method enables robust estimation of age-specific infectiousness and susceptibility. Accounting for these age heterogeneities is essential for projecting the impact of age-targeted interventions. Our approach is adaptable to other respiratory infections and can guide more tailored public health responses.

3

Beyond event-rate enrichment: proteomic risk scores for mechanism-aware prevention trial design

Fieggen, J.; Simond, G.; Segal, B. M.; Noori, A.; Thakurta, A.; Butler, C. C.; Clifton, D. A.; Clifton, L.

2026-06-10 health informatics 10.64898/2026.06.09.26355266 medRxiv

Top 0.2%

2.0%

Show abstract

Background. Blood-based biomarkers are increasingly proposed for identifying high-risk individuals before clinical disease and for making prevention-oriented trials more efficient. Prognostic enrichment can increase event rates, but trial efficiency also depends on whether the intervention effect is preserved in the enriched population. Methods. Using the UK Biobank Pharma Proteomics Project, we trained disease-specific proteomic risk scores (ProRS) from 2,916 plasma proteins with elastic-net Cox models. We compared ProRS, polygenic risk scores (PRS), and combined PRS--ProRS scores across ten incident diseases. We estimated cumulative incidence and theoretical two-arm time-to-event trial sample sizes across risk strata. To evaluate effect preservation, we examined six intervention-analogue exposure--outcome pairs spanning genetic (PCSK9/coronary artery disease, APOE/Alzheimer's disease, PPARG/type 2 diabetes, IL23R/Crohn's disease), behavioural (physical activity/all-cause mortality), and pharmacological (RAAS inhibitors versus calcium channel blockers/coronary artery disease) examples. Results. ProRS outperformed PRS for 9 of 10 diseases (median C-index 0.75 versus 0.61). ProRS and PRS were weakly correlated (median Pearson |r| = 0.04), and joint PRS--ProRS stratification identified groups with higher observed incidence than either score alone for several endpoints. In the top risk quartile, combined-score enrichment reduced theoretical required sample sizes by 32--74\% under a fixed 20\% relative hazard reduction. These gains were not always preserved when stratum-specific intervention-analogue effects were used. Effects were broadly preserved for APOE/Alzheimer's disease and physical activity/mortality. The PPARG/type 2 diabetes effect attenuated toward the null under all three score types, showing that event-rate enrichment does not guarantee effect preservation. For IL23R/Crohn's disease and the antihypertensive comparison, point estimates differed across score types -- preserved under polygenic but attenuated under proteomic enrichment -- but confidence intervals were wide and overlapping. Conclusions. Proteomic risk scores can identify high-event-rate populations for prevention-oriented trials, but event-rate enrichment alone is insufficient for trial design. Biomarker-guided enrichment should evaluate mechanism-specific effect preservation and may be preferable as a stratification or adaptive-design variable rather than as a restrictive eligibility criterion.

4

MultiAge: A New Multidimensional Biomarker of Biological Age Derived from Comprehensive Phenotypic and Molecular Profiling

Vetter, V. M.; Junge, M. P.; Ding, G.; Weihs, A. L.; Drewelies, J.; Duezel, S.; Homann, J.; Maetzel, E.-M.; Spira, D.; Grabe, H. J.; Grill, E.; Lindenberger, U.; Nauck, M.; Pawelec, G.; Peters, A.; Steinhagen-Thiessen, E.; Thorand, B.; Voelzke, H.; Winkelmann, J.; Berger, K.; Teumer, A.; Waldenberger, M.; Gerstorf, D.; Lill, C. M.; Bertram, L.; Demuth, I.

2026-06-05 epidemiology 10.64898/2026.06.01.26354629 medRxiv

Top 0.3%

1.9%

Show abstract

Background: It is an everyday observation that people of the same chronological age differ with respect to their physical and mental capacity. However, assessing these differences in biological age remains challenging. Methods: Here, we aggregate 89 age-associated variables from the Berlin Aging Study II (BASE-II, n=1,631) to generate MultiAge, a new marker of biological age that summarizes information from ten domains reflecting organ health and global biological age. We then used methylation data obtained from an Illumina MethylationEPIC array and supervised machine learning to translate MultiAge into a DNA methylation signature, MultiAgeEpi (309 CpGs), which was subsequently validated in four independent external validation cohorts (KORA FF4, KORA Age, SHIP-TREND, BiDirect, total n=4,339). MultiAgeEpi results were compared with previously published epigenetic clocks (GrimAge, DunedinPACE, SystemsAge). Results: We report that MultiAgeEpi showed similar, and in several cases, stronger associations with age-associated outcomes such as diabetes, metabolic syndrome, multimorbidity, frailty and mortality (q < 0.05) compared to the other clocks. Conclusions: MultiAge and MultiAgeEpi thus provide a comprehensive assessment of biological age through aggregation of numerous age-associated variables and the use of the high-resolution methylomics data makes transfer of this marker to other cohorts possible.

5

Cultural engagement and mental disorders: A prospective negative control analysis of the English Longitudinal Study of Ageing with linked Hospital Episode Statistics

Qin, P.; Steptoe, A.; Fancourt, D.

2026-06-08 epidemiology 10.64898/2026.06.05.26354991 medRxiv

Top 0.4%

1.3%

Show abstract

Cultural engagement is associated longitudinally with better mental health and reduced depression incidence, but evidence has largely relied on self-reported symptoms and diagnoses, leaving uncertainty about clinically recorded disorders, and residual confounding remains a concern. Here, we examined whether cultural engagement (including going to cinemas, museums, galleries, exhibitions, theatre, concerts, or opera) predicts hospital-treated mental disorders in 8,274 adults aged 50 years or older from the English Longitudinal Study of Ageing. Participant records were linked to ICD-10 diagnoses in Hospital Episode Statistics and mortality records with follow-up of up to 20 years. In fully adjusted Cox models accounting for sociodemographic, lifestyle, and social factors and multiple testing, frequent cultural engagement was associated with lower risk of any mental disorders (HR 0.71, 95% CI 0.62-0.82, FDR adjusted P value<0.001), dementia (0.71, 0.56-0.89, FDR adjusted P value=0.010), substance misuse (0.75, 0.59-0.95,FDR adjusted P value=0.040), and mood disorders (0.73, 0.56-0.95, FDR adjusted P value=0.044), but not neurotic disorders. Associations persisted after excluding early incident cases and adjusting for baseline depressive symptoms and cognition, and showed robustness to unmeasured confounders. To further probe causality, eye disease, ear disease, and traumatic brain injury, which share similar socio-demographic profiles to mental disorders, were prespecified as negative control outcomes. Cultural engagement was not associated with any negative control outcomes. These findings provide triangulated statistical data to suggest that cultural engagement is associated with reduced risk of several clinically recorded mental disorders and support further testing of cultural engagement as a population mental health strategy.

6

Physical activity, fatty acids, and MASLD risk: Behavioural and metabolic factors jointly shaping liver health in populations

Chen, F.; You, R.; Liu, Y.; Yin, Y.; Liu, A.; Deng, L.; Xie, B.; Fan, J.; Wang, W.

2026-06-08 epidemiology 10.64898/2026.06.05.26354982 medRxiv

Top 0.5%

1.2%

Show abstract

Background and Aims: MASLD has become the most prevalent chronic liver disease globally. Although MVPA and plasma fatty acids have been individually studied in relation to metabolic health, their independent and combined associations with MASLD incidence remain unclear. We aimed to investigate these associations. Methods: This study included 51,717 UK Biobank participants free of liver disease at baseline, with MVPA measured using wrist-worn accelerometers and plasma fatty acids quantified via NMR. Multivariable-adjusted Cox models and restricted cubic splines were used. Results: Over a median follow-up of 7.8 years, 472 incident cases were identified. In fully adjusted models, meeting recommended MVPA levels together with higher n-6 PUFA concentrations was associated with a 71% lower risk (HR 0.29, 95% CI 0.18-0.45). The MVPA-MASLD association was nonlinear, with risk reduction plateauing at approximately 189 minutes per week. Higher n-6 PUFA was associated with reduced risk, whereas n-3 PUFA showed no significant association. Conclusions: These findings suggest that behavioral and metabolic factors may jointly influence MASLD risk. Further studies in diverse populations are needed to confirm these associations.

7

EXHEART: A Fairness-Aware Explainable Stacked Ensemble for Cardiovascular Disease Classification with Cross-Instrument Disparity Attribution

Biswas, M. A.; Laila, A.

2026-06-05 health informatics 10.64898/2026.06.03.26354879 medRxiv

Top 0.5%

1.2%

Show abstract

Background: Machine learning models trained on population health surveys offer scalable tools for cardiovascular screening, but recurring methodological weaknesses undermine their credibility and equity: data leakage from synthetic oversampling, qualitative rather than quantitative explainability evaluation, and the absence of demographic fairness auditing at the clinical operating threshold. Methods: We present EXHEART, a leakage-free stacked ensemble pipeline trained on BRFSS 2015 (n = 253,680) and validated on BRFSS 2020 (n = 319,795; temporal transport and retrain) and a clinical cardiovascular examination dataset (n = 68,730). The pipeline combines XGBoost, LightGBM, Random Forest, and a multi-layer perceptron as base learners with 5-fold out-of-fold logistic regression stacking and Platt scaling calibration. A quantitative SHAP-LIME consistency framework, based on Kendall-tau rank correlation and Jaccard overlap, accompanies a decision-curve analysis, a subgroup-stratified SHAP interaction analysis, and an intersectional fairness audit (Sex x Age x Income) with threshold-shifting mitigation and a frontier of the fairness-utility trade-off. The framework also adds cross-instrument fairness-disparity attribution, an empirical diagnostic that provides evidence on whether an observed subgroup disparity is more consistent with a measurement-induced or a substantive explanation by re-validating it on a dataset that measures the same clinical construct objectively. On heart disease, this diagnostic associates 89% of the sex TPR gap (95% CI [0.65, 0.99]) with the self-reported survey outcome rather than with a substantive risk difference. Results: On BRFSS 2015, EXHEART achieves AUC-ROC = 0.850, AUPRC = 0.371, Brier score = 0.071, and reduces ECE by 96% (0.256 to 0.011) via Platt scaling. Global SHAP-LIME rank agreement is moderate-to-strong (Kendall-tau = 0.580, Spearman-rho = 0.818) with a substantial top-3 divergence (Jaccard@3 = 0.200), where Stroke flips from SHAP rank 8 to LIME rank 1. The Sex TPR gap is 0.124 at the screening threshold; intersectional Sex x Age disparities reach 0.649 among adequately-powered cells, 5.2x the single-attribute gap. Temporal transport to BRFSS 2020 collapses sensitivity from 0.776 to 0.267, while retraining restores AUC = 0.840 and ECE = 0.012. On clinical examination data, the Sex TPR gap collapses to 0.014; the attribution test indicates this gap is instrument-dependent, consistent with a measurement or outcome-definition explanation rather than a substantive risk difference. Cross-domain SHAP analysis identifies four instrument-independent CVD risk factors and two major portability failures. Conclusions: EXHEART combines three practices that population-scale cardiovascular classifiers usually apply in isolation: leakage-free training with calibrated probabilities, a test of whether the model's explanations are stable, and a fairness audit that examines intersecting subgroups rather than single attributes. Bringing them together proved worthwhile. The intersectional audit revealed disparities that single-attribute auditing missed, and the cross-instrument comparison indicated that much of the sex gap reflects how the outcome is measured in survey data rather than a substantive difference in risk. The temporal transport findings indicate that deployed BRFSS models warrant periodic monitoring and retraining to maintain clinical utility. EXHEART is a retrospective methodological evaluation on public de-identified data; it is not validated for direct clinical decision-making, diagnosis, or treatment recommendation without prospective clinical validation.

8

Study Design Indexing in Transition: A Focused Comparison of manual NLM Indexing vs. Transformer-based Automated Models

Das, P.; Schneider, J.; Mayo-Wilson, E.; Kilicoglu, H.; Menke, J. D.; Nam, D.; Ninan, K.; Oberste, J.-P.; Troy, A. M.; Ying, X.; Holt, A. W.; Smalheiser, N. R.

2026-06-04 health informatics 10.64898/2026.06.03.26354854 medRxiv

Top 0.6%

0.9%

Show abstract

Objectives: Study design indexing of biomedical publications is crucial for evidence retrieval and synthesis. We sought to evaluate the accuracy and suitability of a transformer-based model (TM) for indexing clinical study designs, in comparison to National Library of Medicine (NLM) indexing. However, this is challenging for at least three reasons: First, to date, all automated systems have been trained and evaluated on manual NLM indexing assignments, itself subject to errors. Second, TM's probabilistic predictive scores take into account uncertainty, and can be converted to TRUE/FALSE assignments in different ways depending on the needs of users, while NLM labels are categorical. Third, our goal (to tag articles only that exhibit a given design) differs from NLM which tags articles that both discuss as well as exhibit that design. Materials and Methods: Therefore, we carried out a limited evaluation of the TM model that focuses only on the articles that received the most confident predictions, that is, the highest scores that are almost certainly TRUE and the lowest scores that are almost certainly FALSE, but which disagreed with NLM assignments. This was performed both for articles published in 2016 (when NLM decisions were manual) and in 2025 (when NLM decisions were automated). To establish ground truth, dual annotators indexed the articles independently, following written definitions, for four prominent study designs--cohort, case-control, cross-sectional, and case report. Results: For three designs (case-control, case report, cross-sectional), the articles having the top 100 predictive TM scores (when NLM failed to assign that design) were judged to exhibit that design in the great majority (86-100%) of cases. Conversely, the articles having the lowest 100 predictive TM scores (when NLM did assign the study design) exhibited the design only in relatively few (0-21%) of cases. The most confident predictions of the TM model were highly accurate and not redundant with automated NLM indexing; the exception was cohort studies articles, in which both TM and NLM labels showed high error rates of both omission and commission. Discussion and Conclusion: TM may have value for identifying articles exhibiting study designs, which is especially important for clinical decision-making as well as systematic reviews and other evidence syntheses. NLM indexing of cohort studies cannot be regarded as a reliable gold standard for training or evaluation of automated systems, warranting efforts to create a new manually annotated corpus.

9

Mortality in people with attention-deficit/hyperactivity disorder (ADHD): Examining how risk is embodied in a pooling of two prospective cohort studies

Li, H.; Ford, T.; Warrier, V.; Bell, S.; Batty, G. D.

2026-06-09 epidemiology 10.64898/2026.06.08.26355148 medRxiv

Top 0.7%

0.8%

Show abstract

Background. Nascent findings suggest that people with attention-deficit/hyperactivity disorder (ADHD) experience higher rates of mortality. To date, study samples have been insufficiently well-characterized to examine the mechanisms via which this neurodevelopmental condition elevates mortality risk. Methods. We used data from the 2007 and 2011 waves of the US National Health Interview Survey, a general population-based cohort study comprising 52097 adults (28675 women) aged 18 years or older at baseline. ADHD diagnosis and an array of demographic, socioeconomic, lifestyle, and co-morbidity (somatic and psychiatric) covariates were self-reported. Findings. At baseline, compared with unaffected individuals, participants with ADHD were more likely to be socioeconomically disadvantaged, smoke cigarettes, consume alcohol, and report symptoms of psychological distress. A median 7.75 years of mortality surveillance (range: 7.25-12.25) gave rise to 6597 deaths from all-causes. After adjustment for age, sex, ethnicity, and survey year, ADHD was associated with a markedly elevated risk of death (hazard ratio [95% confidence interval]: 1.58 [1.20-2.09]). Statistical adjustment for socioeconomic circumstances (11% attenuation), physical co-morbidities (15%), and lifestyle factors (17%) had only a modest impact on the ADHD-death gradient, with the greatest explanatory power apparent for symptoms of depression and anxiety (58%). The magnitude of the association of ADHD with mortality was commensurate to that for several well-established risk factors such as poverty (1.66 [1.55-1.78]), hypertension (1.41 [1.32-1.51]), and diabetes (1.71 [1.59-1.85]) but somewhat lower than cigarette smoking (2.51 [2.29-2.76]) after controlling for age, sex, ethnicity, and survey year. Associations between ADHD and cause-specific mortality from cardiovascular disease, cancer, and chronic respiratory disease were inconclusive. Interpretation. In the present study, the influence of ADHD on total mortality appears to be largely embodied via a series of malleable characteristics, particularly mental illness. If confirmed elsewhere, these results raise the possibility that risk factor modification via standard pharmacological and behavioral interventions could help reduce rates of premature mortality in this patient group. Funding. This paper received no direct funding. GDB is supported by the UK Medical Research Council (MR/P023444/1) and the US National Institute on Aging (1R56AG052519-01, 1R01AG052519-01A1).

10

Arts and Cultural Engagement and Multidimensional Well-being in Later Life

Noguchi, T.; Erhua, S.; Hayashi, T.

2026-06-04 epidemiology 10.64898/2026.06.02.26354582 medRxiv

Top 0.7%

0.7%

Show abstract

Background and Objectives: Arts and cultural engagement may contribute to well-being in later life. However, evidence from longitudinal studies from Asia, including Japan, remains limited. This study examined the association of arts and cultural engagement with subsequent multidimensional well-being among older adults in Japan, one of the fastest-aging countries. Research Design and Methods: This longitudinal study used panel data from 354 individuals aged 60 and older (mean age 74.0 years; 78.6% women) who completed self-administered questionnaires by mail between 2022 and 2024. The PERMA-Profiler was used to assess five multifaceted aspects of psychological well-being: positive emotion, engagement, relationships, meaning, and accomplishment. Frequencies of arts and cultural engagement at baseline were measured for active (e.g., activities by individuals and participation in groups, such as music and painting) and receptive (e.g., visiting museums, galleries, and theaters) forms. Results: Multivariable linear regression analysis, adjusted for the covariates including baseline PERMA scores, showed that higher frequencies of active engagement were positively associated with higher PERMA scores for all domains. Higher frequencies of receptive engagement were associated with the domains of positive emotion, meaning, and accomplishment, but not clearly associated with engagement and relationships. Restricted cubic spline analyses suggested clearer positive frequency-response patterns for active engagement than for receptive engagement. Discussion and Implications: Arts and cultural engagement, both active and receptive forms, was associated with subsequent multiple aspects of well-being in later life. These findings suggest the importance of ensuring access to arts and cultural opportunities for older adults to create, participate, and connect.

11

Age-specific burden of medically attended respiratory virus disease in high-income countries: a scoping review and meta-analysis

Gupta, M.; Zoega, H.; Stopard, I. J.; Liu, B.; Macartney, K.; Wood, J. G.; Hogan, A. B.

2026-06-10 epidemiology 10.64898/2026.06.09.26354660 medRxiv

Top 1%

0.5%

Show abstract

Introduction: Respiratory infections are a leading cause of morbidity. Newly available vaccines to prevent respiratory syncytial virus (RSV) disease and encouraging clinical progress on vaccines for human metapneumovirus (hMPV) and parainfluenza (PIV) could reduce the disease burden beyond existing influenza and SARS-CoV-2 immunisation programs. However, evidence on the contribution of these viruses to respiratory disease burden across the lifespan remains limited. Methods: We reviewed studies from 01/2002-11/2025 reporting age-stratified, medically attended cases of influenza, and at least one of RSV, hMPV, or PIV, in high-income countries, excluding periods substantially overlapping with the COVID-19 pandemic. Using only studies that tested for all four viruses, we estimated the age-specific proportion of cases that were non-influenza (total across RSV, hMPV and PIV) compared to influenza using a mixed-effects logistic regression model. Results: Following exclusions and screening, 61 studies were included in the primary analysis comprising >500,000 detections of the four viruses. We found that a substantial proportion of medically attended respiratory illness in infants and young children was due to PIV, hMPV and RSV, rather than influenza, with a non-influenza virus proportion of 90.2% (95% CI 85.9-93.2%) in young infants aged 0-6 months. The converse was true for school-aged children, with a non-influenza virus proportion of 34.8% (95% CI 26.5-44.2%) in children aged 5-18 years. In adults aged 65+ years, non-influenza causes of medically attended disease were common at 60.2% (95% CI 50.0-69.5%). Restricting to studies reporting hospitalised cases (n=19) produced broadly similar age-specific trends in relative virus burden contributions. Discussion: We highlight the significant burden of medically attended illness due to PIV, hMPV and RSV across ages, particularly in infant and preschool-aged children and older adults, supporting the need for effective vaccines targeting this burden.

12

Education/training for health workers/students on inclusive and gender-affirmative care for trans and gender-diverse people: a systematic review

Xia, J.; Zhu, Z.; Zhang, G.; Shen, Q.; Su, E.; Schoones, J.; Arcelus, J.; Hu, T.; Xu, M.; Zhang, X.; Zhao, Z.; Ye, Z.; Yao, X.

2026-06-05 health policy 10.64898/2026.06.04.26354880 medRxiv

Top 1%

0.5%

Show abstract

Introduction: Trans and gender-diverse (TGD) individuals often face stigma and discrimination in healthcare, hindering access to gender-affirming care. Training healthcare workers on TGD health aims to foster inclusive and affirming care practices. This review aimed to evaluate the effectiveness of TGD health training programs for healthcare workers. Methods: This systematic review followed the PRISMA guidelines and was registered with PROSPERO (CRD42023443288). We searched 13 databases for studies up to March 2024, with no language/geographic restrictions. Ten reviewers screened studies in pairs, resolving discrepancies via discussion or third-reviewer input. We included randomized/non-randomized comparative and before-after studies for quantitative analysis (mean difference [MD] or standardized mean difference [SMD] with 95% CIs) and qualitative/mixed-methods studies for thematic synthesis. Evidence certainty was assessed using GRADE (quantitative) and GRADE-CERQual (qualitative). Outcomes included knowledge, attitudes, skills, discrimination, competence, comfort, TGD quality of life, and stakeholder preferences. Results: From 20,188 records, 85 studies were included. Training appears to have improved healthcare workers' knowledge (SMD=1.08, 95% CI 0.78-1.39), attitudes (SMD=0.22, 95% CI 0.05-0.39), skills (SMD=0.96, 95% CI 0.56-1.37), competence (SMD=0.55, 95% CI 0.29-0.81), and comfort (SMD=0.69, 95% CI 0.17-1.21). Qualitative analysis of 130 findings identified 18 categories and four key themes on intervention design and impact. Conclusions: TGD training programs may enhance health workers' knowledge, attitudes, skills, competence, and comfort. Well-structured, interactive, and inclusive programs showed promise, but evidence certainty was low with limited follow-up. Further high-quality research is needed to confirm these findings.

13

Early life multidimensional disadvantage of South Australian children: a whole-population linked data study

Kalamkarian, A.; Pilkington, R. M.; Lynch, J.; Mittinty, M. N.; Malvaso, C.; Hawkins, K.; Pharo, H.; Beck, K.; Chittleborough, C. R.

2026-06-05 epidemiology 10.64898/2026.06.03.26354860 medRxiv

Top 1%

0.4%

Show abstract

Background: Whole-population linked administrative data platforms provide an opportunity to generate evidence on early life multidimensional disadvantage to inform resourcing and service provision to families with complex needs. Methods: We used individual-level de-identified data from nine administrative data sources included in the Better Evidence Better Outcomes Linked Data (BEBOLD) platform. The population included all children born in South Australia between 2004-2011 (n=143,083), and their parents. We described the prevalence and distribution of multiple disadvantages affecting children from the 12 months before birth to age 5. Eleven domains of parental disadvantage were created: economic, education, access to services, mental health, substance misuse, smoking during pregnancy, domestic and family violence, health, child protection contact, justice system contact, and death. We investigated the concordance of our measure with an area-level socioeconomic measure used in government reporting. Results: One in two children (48%) were exposed to at least one disadvantage domain, and one in seven (14%) were exposed to three or more domains before age five. Economic disadvantage was most prevalent, affecting one in four (27%) children, of which 75% were exposed to additional forms of disadvantage. Substance misuse, domestic and family violence, and justice system contact were the least likely domains to occur in isolation. Only 54.4% who experienced five or more disadvantage domains were classified in the area-level socioeconomic measure's 'most disadvantaged' quintile. Conclusion: Early life exposure to parental disadvantage can be highly multidimensional. Measurement across different systems is important for informing coordinated service provision for families with complex needs.

14

Cardiovascular-Kidney-Metabolic Syndrome Among US Adults, 1999-2023: National Trends and Projections Through 2050

Fu, F.; Wei, A.; Wang, G.; Fang, S.; Chen, J.; Liu, W.; Liu, H.; Gao, X.; Lei, Y.; Guo, N.; Chen, M.; Yu, J.; Wang, Y.; Li, S.; Mao, Y.; Yan, L.

2026-06-10 health systems and quality improvement 10.64898/2026.06.08.26355220 medRxiv

Top 1%

0.4%

Show abstract

Background Cardiovascular-kidney-metabolic (CKM) syndrome integrates adiposity, metabolic risk, kidney dysfunction, and cardiovascular disease in a prevention-oriented framework. National estimates across 1999-2023 NHANES and future burden remain limited. Methods We analyzed US adults aged 20 years from 11 NHANES cycles, 1999-2000 through August 2021-August 2023. CKM stage 0-4 was assigned using harmonized examination, laboratory, medication, and questionnaire data. Prevalence was survey-weighted and standardized to the 2010 US Census adult population. Decade trends used survey-weighted logistic regression adjusted for age, sex, and race and ethnicity. Exploratory 2040 and 2050 projections combined NHANES prevalence models with US Census projections under population-aging-only, trend-continuation, and risk-improvement scenarios. Results Among 62,890 eligible adults, 62,888 had sufficient CKM data. In 2021-2023, age-standardized prevalence was 87.9% (95% CI, 86.5%-89.4%) for CKM stage 1 and 62.0% (95% CI, 60.1%-63.8%) for stages 2-4. Stage 2 accounted for 50.1% (95% CI, 48.2%-51.9%) and stages 3-4 for 11.9% (95% CI, 11.0%-12.7%). From 1999-2000 to 2021-2023, any CKM increased by 4.6 percentage points (95% CI, 2.4 to 6.9; P<0.001), whereas stages 2-4 changed by 2.1 percentage points (95% CI, 5.1 to 0.8; P=0.156). In adjusted decade models, any CKM increased (OR, 1.28; 95% CI, 1.19-1.38; P<0.001), while stages 2-4 showed no significant linear trend (OR, 0.95; 95% CI, 0.89-1.01; P=0.084). Excess adiposity and diabetes increased, dyslipidemia declined, and hypertension, chronic kidney disease, and clinical cardiovascular disease were stable. With population aging alone, projected stages 2-4 burden rose from 164.8 million adults in 2023 to 193.7 million in 2050; under risk improvement, it was 147.7 million. Conclusions CKM syndrome remained highly prevalent among US adults. Although later stages did not increase significantly, population aging may expand the absolute care burden unless broad risk improvement occurs.

15

Development of Longitudinal, Linked Maternal-Infant Cohorts using the Epic Cosmos Electronic Health Record Dataset

Leonard, S. A.; Dysart, K.; Callahan, A.; Siadat, S.; Zhang, J.; Handley, S. C.; Huybrechts, K. F.; Igbinosa, I.; Bateman, B. T.

2026-06-04 epidemiology 10.64898/2026.06.02.26354757 medRxiv

Top 1%

0.3%

Show abstract

Background: Epic Cosmos is a relatively new centralized electronic health record dataset with high potential utility in perinatal epidemiologic research. Objectives: The study objectives were to develop replicable steps to create longitudinal, linked maternal-infant cohorts in Cosmos, assess completeness of key variables, evaluate potential selection bias with restrictions for longitudinal healthcare encounters, and provide an example epidemiologic analysis. Methods: We created maternal-infant cohorts by starting with live births during 2023-2024 recorded in the BirthFact data table and joining with additional data tables as needed. We selected and created variables for perinatal characteristics, common comorbidities, and routinely measured vital signs and laboratory values, and assessed variable completeness. We sequentially restricted the birth cohort for maternal-infant linkage and longitudinal healthcare from first-trimester prenatal care encounter through infant follow-up care within 12 weeks post-discharge from birth hospitalization. Finally, we conducted an example analysis of the association between high systolic blood pressure in the first trimester ([≥]140 mm Hg) and later onset of preeclampsia among those with chronic hypertension. Results: The total linked birth cohort included 2,624,186 pregnancies. Completeness was >90% for most variables assessed but was 77% for racial and ethnic group and 76% for body mass index at delivery. Characteristics of the cohort were similar to those reported for the entire United States birth population based on birth certificate data, including similar regional and racial-ethnic composition. Longitudinal cohort restriction requiring linked records from first trimester prenatal care through infant follow-up care reduced the cohort size to 509,148 pregnancies. However, restriction had minimal effects on cohort characteristics. In the example analysis, high systolic blood pressure was associated with increased risk of preeclampsia among those with chronic hypertension (aRR: 1.26; 95% CI: 1.22, 1.30). Conclusions: This study provides a rigorous and reproducible approach to creating longitudinal, linked maternal-infant cohorts in Epic Cosmos and the analytical findings suggest high data quality and representativeness.

16

A New Mixed Frequency Regression Model For Environmental Epidemiology

Shukla, N.; Bartington, S. E.; Hansell, A. L.; Lucas, T. C.

2026-06-04 epidemiology 10.64898/2026.06.03.26354801 medRxiv

Top 1%

0.3%

Show abstract

Background: In the absence of high-resolution response data, exposure-response modelling often relies on aggregated low-frequency exposure data, leading to loss of high-resolution information. Mixed Data Sampling (MIDAS) from econometrics offers an alternative but is limited due to its inability to make high-resolution predictions, inflexible likelihoods and penalised nonlinear functions, and limited visualization options. We propose a mixed-frequency Distributed Lag Non-linear Model (mf-DLNM) which can eliminate the need to aggregate exposure data in environmental epidemiology and provide high resolution predictions for time series studies. Methods: We evaluated the inference and predictive performance of the mf-DLNM. To evaluate its ability to estimate exposure-response relationships, we applied mf-DLNM and same-frequency (sf)-DLNM using data from the West Midlands, UK. Additionally, we compared the predictive performance of mf-DLNM with sf-DLNM and MIDAS across nine regions of England. As MIDAS cannot predict at the resolution of the predictor (daily), we compared the predictive performance of mf-DLNM and MIDAS at weekly resolution. To test the model's ability to predict high temporal resolution risk (daily), we compared sf-DLNM (with access to daily mortality counts) with mf-DLNM (with access only to weekly mortality counts). Results: In the West Midlands example, mf-DLNM performed comparably to sf-DLNM in estimating daily risk of temperature on respiratory mortality. Furthermore, mf-DLNM and MIDAS exhibited similar performance for weekly predictions. For high-resolution predictions, mf-DLNM and sf-DLNM showed nearly similar performance, despite mf-DLNM having access only to low-resolution response data. Conclusion: This mixed-frequency approach in environmental epidemiology overcomes the limitations of predicting health risks using aggregated exposure data and provides estimates of high-resolution outcomes in the absence of high-frequency health outcome datasets.

17

Context-Dependent Age-Group performance hierarchies limit fairness interventions in PPG-based heart rate prediction

Panchumarthi, L. Y.; Kataria, S.; Wu, Y.; Hu, X.; Fedorov, A.; Kwak, H. G.

2026-06-05 health informatics 10.64898/2026.06.04.26352929 medRxiv

Top 2%

0.3%

Show abstract

Background. Fairness-aware machine learning increasingly targets demographic performance disparities in clinical prediction, yet whether standard bias mitigation strategies genuinely improve equity in physiological signal analysis remains unclear. Age-based disparities in photoplethysmography (PPG)-based heart rate prediction present a particular challenge, as age-related performance differences may reflect context-dependent physiological structure rather than correctable artifacts. Methods. We evaluated three fairness interventions, inverse-frequency weighting (IF), Group Distributionally Robust Optimization (GroupDRO), and adversarial debiasing (ADV), applied via fine-tuning of a PPG foundation model across three clinical datasets spanning intensive care unit, laboratory, and consumer wearable contexts. Outcomes were assessed using a 2x2 framework classifying each intervention-dataset combination by the joint direction of change in mean absolute error (MAE) and fairness gap (FG) across age groups, yielding four outcome types: genuine improvement (G), leveling down (L), selective benefit (S), and both worse (W). Results. Across nine intra-domain conditions, no intervention simultaneously improved both MAE and FG (0/9 genuine improvement). The dominant pattern was leveling down (5/9): FG decreased but was accompanied by MAE degradation, indicating that apparent fairness gains were achieved at the cost of overall predictive performance. Age-group difficulty ordering varied across clinical contexts at baseline and was not preserved under intervention. In 18 cross-domain transfer conditions, genuine improvement was rare (4/18) and observed exclusively in non-MIMIC source configurations; models fine-tuned on MIMIC-sourced data yielded no genuine improvements (0/6). Embedding-level representation changes following fine-tuning did not reliably predict fairness outcomes. Conclusions. Age-based fairness interventions in PPG heart rate prediction indicate a leveling-down pattern rather than genuine equity improvement, suggesting that age-related performance gaps reflect context-dependent physiological structure not fully addressable through standard bias mitigation. Cross-domain transfer further amplifies this instability. These findings suggest that fairness evaluation frameworks for age-stratified physiological prediction should account for context-dependent performance structure rather than treating observed gaps as correctable bias.

18

Large Language Models in Healthcare Simulation Education: A Bibliometric Analysis with AI-Assisted Screening

Pears, M.; Wadhwa, K.; Payne, S. R.; Konstantinidis, S. T. H.; Biyani, C. S.

2026-06-04 urology 10.64898/2026.06.02.26354722 medRxiv

Top 2%

0.2%

Show abstract

Large language models (LLMs) such as ChatGPT are rapidly reshaping healthcare education and simulation-based training in non-technical skills (NTS), yet no bibliometric analysis has mapped this landscape. We searched seven open-access databases (OpenAlex, PubMed, Europe PMC, Crossref, Semantic Scholar, CORE, DOAJ) for English-language publications from January 2020 to March 2026. From 100,277 initial records, a sequential keyword funnel yielded 830 candidate papers, which were screened by 83 independent Claude Sonnet 4.6 AI agents applying pre-specified inclusion criteria (PRISMA-trAIce compliant; Cohen's kappa = 0.86 pre-reconciliation, 1.0 post-reconciliation). The final AI-verified corpus comprised 551 papers with a compound annual growth rate of 109%, contributions from 2,398 authors across 279 journals in 58 countries, and an h-index of 41. ChatGPT dominated the model landscape (46% of papers), with open-source models virtually absent. Virtual patient chatbots were the leading simulation modality (106 papers). Among NTS domains, communication (145 papers) and decision-making (135 papers) were most studied, whereas teamwork, leadership, situational awareness, and crisis resource management were markedly underrepresented. Only 6 urology-relevant papers were identified, none examining LLM integration within boot camp training formats. The field is growing at extraordinary pace but remains concentrated in a narrow range of NTS domains and a single proprietary model. Critical gaps persist in team-based skills training, open-source model evaluation, and specialty-specific simulation. AI-assisted bibliometric screening using multiple independent agents is feasible, reliable, and scalable, offering a replicable methodology for mapping fast-evolving research fields.

19

Integrated cardiometabolic and nutritional risk profiling identifies pregnancy loss as a marker of systemic metabolic vulnerability

Agarwal, T.; Namburu, J. R.; Kachroo, P.

2026-06-08 epidemiology 10.64898/2026.06.04.26354910 medRxiv

Top 2%

0.2%

Show abstract

Background: Pregnancy loss has important implications for womens health. Although maternal age is a well-established risk factor, the contribution of routinely measured cardiometabolic and behavioral markers at population-scale remains incompletely characterized. Objective: To examine associations between cardiometabolic, nutritional, and behavioral risk markers and pregnancy loss among U.S. women of reproductive age. Methods: We conducted a cross-sectional analysis of 4,842 U.S. women aged 20-44 years with [≥]1 pregnancy using the National Health and Nutrition Examination Survey data (2013-2023). Pregnancy loss was defined as [≥]1 prior miscarriages. Exposures included body mass index, smoking exposure (cotinine), lipid biomarkers, vitamin D and folate, and a composite cardiometabolic-nutritional risk score. Survey-weighted logistic regression estimated adjusted odds ratios (aORs) and 95% confidence intervals, with bootstrap resampling for predictor robustness. Results: The weighted prevalence of pregnancy loss was 23%. Higher odds of pregnancy loss were associated with increasing age (aOR per year=1.02; 95% CI: 1.00-1.04), Non-Hispanic Black race (aOR=1.32; 95% CI: 1.00-1.74), overweight (aOR=1.56; 95% CI: 1.16-2.11), obesity (aOR=2.06; 95% CI: 1.39-3.05), and smoking (aOR=1.58; 95% CI: 1.19-2.10). Adverse lipid profiles, particularly elevated triglycerides (aOR=1.83; 95% CI: 1.16-2.90) and high low-density lipoprotein (aOR=2.97; 95% CI: 1.45-6.61), were independently associated with pregnancy loss. Vitamin D/folate were not stable predictors. Higher composite cardiometabolic-nutritional risk scores were observed among women with pregnancy loss (P=0.026). Conclusion: Pregnancy loss clustered with adverse cardiometabolic and behavioral risk markers in a nationally representative population. These findings highlight pregnancy loss as a marker of broader metabolic vulnerability supporting the need for longitudinal studies and cardiometabolic profiling to inform preconception care and risk stratification.

20

Clonal Hematopoiesis of Indeterminate Potential Refines Cardiovascular Risk Stratification in Cardiovascular-Kidney-Metabolic Syndrome Stages 0-3

Lu, J.; Sun, S.; Deng, Z.; Wang, S.; Wei, C.; Jiang, S.; Li, W.

2026-06-08 epidemiology 10.64898/2026.06.04.26354963 medRxiv

Top 2%

0.2%

Show abstract

Background: Chronic low-grade inflammation drives cardiovascular-kidney-metabolic (CKM) syndrome. Clonal hematopoiesis of indeterminate potential (CHIP), an age-related driver of systemic inflammation, is linked to several cardiometabolic disorders. However, whether CHIP modifies CKM progression and contributes to heterogeneity in cardiovascular disease (CVD) risk within the CKM framework remains uninvestigated. Methods: This cohort study included 307,025 UK Biobank participants at CKM stages 0-3 free of baseline CVD. CHIP status was identified via whole-exome sequencing (WES). The association between CHIP and baseline CKM severity was examined, along with the independent and joint effects of CHIP and CKM stages on incident CVD risk. The joint effects of CHIP and polygenic risk scores (PRS) were further assessed, and the incremental predictive value of incorporating CHIP into the AHA PREVENT equations was evaluated. Results: CHIP carriers were more likely to present with advanced CKM stages [OR 1.14 (1.09-1.20), P < 0.001] and exhibited higher incident CVD risk during follow-up [HR 1.13 (1.08-1.18), P < 0.001]. Significant joint effects between CHIP and CKM stages were observed, with the highest risk among CHIP carriers at CKM stage 3 [HR 1.63 (1.50-1.78), P < 0.001]. Large or multiple CHIP mutations conferred greater hazards, with distinct gene-specific effects observed. Moreover, CHIP and high genetic risk also jointly amplified CVD susceptibility. Most importantly, incorporating CHIP into AHA PREVENT significantly improved risk discrimination. Conclusions: CHIP is a significant risk factor associated with more advanced CKM stages and amplifies incident CVD risk. Integrating CHIP into existing prevention strategies may refine CVD risk stratification.